Nonconcave entropies from generalized canonical ensembles 
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It is well-known that the entropy of the microcanonical ensemble cannot be calculated as the 
Legendre transform of the canonical free energy when the entropy is nonconcave. To circumvent this 
problem, a generalization of the canonical ensemble which allows for the calculation of nonconcave 
entropies was recently proposed. Here, we study the mean-field Curie-Weiss-Potts spin model and 
show, by direct calculations, that the nonconcave entropy of this model can be obtained by using a 
specific instance of the generalized canonical ensemble known as the Gaussian ensemble. 
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Systems involving long-range interactions, such as 
gravitational forces or wave-particle interactions, have 
many unusual properties at equilibrium which set these 
systems apart from those involving short-range interac- 
tions The root of most, if not all, of these unusual 
properties can be traced back to the fact that the mi- 
crocanonical entropy function of long-range systems can 
be nonconcave as a function of the energy, even in the 
thermodynamic limit. By constrast, short-range systems 
can have nonconcave entropies at finite volume, but any 
nonconcave parts of the entropy must and will disappear 
in the thermodynamic limit, leaving a thermodynamic 
entropy which is always concave This has profound 
physical consequences because the concavity of the en- 
tropy determines ultimately whether the microcanonical 
ensemble (ME) is equivalent with the canonical ensemble 
(CE). If the entropy is concave, then the two ensembles 
are equivalent; otherwise they are nonequivalent at both 
the thermodynamic and macrostate levels [J, |5j . Thus 
short-range systems always have equivalent ensembles, 
but long-range systems need not; they can have equilib- 
rium properties within the ME that have no counterparts 
in the CE. Many illustrations of this possibility have been 
given recently for a variety of systems, including those 
arising in the study of gravitation 0, [t| , geostrophic tur- 
bulence plasmas and magnetism 0,0,0,0]- 

Systems with nonconcave entropies are special not only 
in terms of their equilibrium properties, but also in the 
way in which their entropy can be calculated. In cases 
of ensemble equivalence, it is well known that the mi- 
crocanonical entropy function can be calculated as the 
Legendre transform of the canonical free energy func- 
tion. This way of calculating the entropy goes back to 
Gibbs, and is nowadays the way of choice for calculat- 
ing entropies because of the many practical advantages 
it offers. One well known advantage is that calculations 
carried out at the level of the free energy are generally 
more tractable than those carried out at the level of the 
entropy directly. Another advantage is that the definition 
of the free energy allows for many approximation schemes 



(e.g., perturbative expansions or variational principles), 
which are not available in the microcanonical ensemble. 

The problem for systems having a nonconcave entropy 
is that the entropy does not correspond to the Legendre 
transform of the free energy anymore. This is obvious if 
one recalls that Legendre transforms yield only concave 
functions. Therefore, if one knows or suspects that a 
given system has a nonconcave entropy, then one also 
knows that this entropy cannot be calculated in the CE 
as the Legendre transform of the free energy. In this case, 
one must rely on the ME to perform that calculation (see, 
e.g., 00). 

Our goal in this paper is to illustrate an alternative 
method for calculating entropy functions which goes be- 
yond the CE in that it can be used to obtain nonconcave 
entropies. The method is not based on the ME. Rather, 
it is based on a generalized canonical ensemble (GCE) 
put forward recently by us, and works by modifying the 
structure of the Legendre transform through the use of 
a generalization of the free energy function. We have al- 
ready presented the theory of the GCE and of its equiva- 
lence with the ME in |15lll6| : our goal here is to illustrate 
this theory with the help of a simple spin model known as 
the mean-field Curie- Weiss-Potts model 

[ISO- For this 

model, we shall calculate a nonconcave entropy function 
using the GCE, and show that the result agrees, in some 
appropriate limit, with the one obtained in the ME. In 
doing so, we shall explain a number of practical aspects 
of the GCE which have not been fully discussed before. 

To begin, we review the definition of the GCE and 
the results establishing equivalence of this ensemble with 
the ME. Following our previous work 0,0], we define 
the GCE by the following probability density over the 
microstates lo: 

p — a.nh{uj) —ng(h(uj)) 



In this expression, h(u>) = H{ui)/n is the energy per par- 
ticle or mean energy, a is a real parameter, g(h) is a 
continuous but otherwise arbitrary function of h, and 
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Z n g (a) is a normalization constant given by 

Z n , g {a) = J e -^M-»9(''M) (iw . ( 2 ) 

This function is naturally interpreted as a generalized 
partition function; from it, we define a generalized free 
energy via the usual limit 

ip g (a)=- lim - In Z n<g (a). (3) 

It is obvious that P n ,g,a(u) and, consequently, Z n ^ g {a) 
and ip g (a) reduce to their standard canonical-ensemble 
expressions for 5 = |l7|. This explains why we use the 
term "generalized" CE. 

To complete this short review of the GCE, we state the 
generalized Legendre transform that will be used here- 
after to obtain the entropy of the ME from the free energy 
of the GCE. At this point, we recall the thermodynamic- 
limit definition of the microcanonical entropy: 

s(u) = lim — In / 8(h(uo) — u) du>. (4) 

n^oo n J 

In terms of <£g(oi) we then have the following: //, for a 
given choice of g, tp g {ot) is differ entiable at a, then 

s(u g , a ) = a%, Q + g(u g , a ) - f g {a), (5) 

where u giCl = (p' g (a) (see Theorem 4). The choice 
g = yields, as expected, the standard Legendre trans- 
form 

s(up) = /3up i- <p(f3), up = <p'{p), (6) 

where <p(p) — <p g =o(a — P) stands for the free energy of 
the CE. For the remaining, it is useful to note that ii ffiQ , 
represents the equilibrium mean energy of the GCE with 
function g and parameter a flj| . This parallels the case of 
the CE for which up — ip'(P) represents the equilibrium 
mean energy of the CE with inverse temperature p. 

We now come to the main point of this paper, which 
is to consider a system known to have a nonconcave en- 
tropy, and show that the system's entropy can be derived 
from the sole point of view of the GCE by first calculat- 
ing cp g (a) for that system and then by applying to (p g (a) 
the generalized Legendre transform shown in (jSJ). The 
model that we consider for this purpose is the q-state 
mean-field Curie- Weiss-Potts (CWP) model [13 111 13 
defined by the Hamiltonian 

n 

i,k=l 

In this expression, 5(x, y) is the Kronecker symbol, u>i is a 
spin variable taking values in the set A = {9 1 , 6* 2 , . . . , 9 q }, 
and us represents the complete configuration of n spins, 
i.e., u> = (u>x, u)2, ■ ■ ■ , us n ). For simplicity, we consider 



the case q = 3, so that spins can only take one of three 
possible values: 1 , 9 2 and 9 3 . 

Despite the rather simple nature of the CWP model, a 
complete, analytical calculation of its generalized free en- 
ergy seems out of reach. For a start, the integral defining 
the generalized partition Z„ j9 (a) does not seem to be ex- 
plicitly solvable, so that the calculation of f g {ot) cannot 
proceed from a direct evaluation of Eqs. 10) and J2J. For- 
tunately, there is an alternative way to calculate <p g {a) 
suggeste d by large deviation techniques. It involves three 
steps [illif: 

(i) Rewrite the energy per spin h(ui) = H(u>)/n as a 
function of some macrostate or order parameter v(ui). In 
our case, we choose the macrostate to be the empirical 
vector v = (i>\, v^, vs), the jth component of which is 
defined by 

n 

fc=l 

A short calculation shows indeed that h(u>) = h(u(u>)), 
where h{v) = —\{v^v). The component Vj represents 
the relative number of spins in u equal to ; < Vj < 1, 
Vj = 1. For this reason, v(uj) is often called the sta- 
tistical distribution of spin state or spin distribution for 
short. The function h is called the energy representation 
function. 

(ii) Derive the expression of an entropy function S(y) 
for the macrostate v. In the case of the empirical vector, 
that entropy function is well known to be given by the 
statistical (Boltzmann-Shannon) entropy 

3 

s(u) = -^Vjlnvj. (9) 

j'=i 

(iii) Calculate tp g (a) , finally, using the following repre- 
sentation formula: 

<p g {a) - if{ah{y)+g{h{u)) - ~s{v)) . (10) 

The infimum is evaluated over all allowed values of v 1 
i.e., all triplets v = (yi, ^2, ^3) such that < Vj < 1 and 

Equation l|10|) was solved numerically using a quadratic 
function g of the form g(u) = 7u 2 /2, where 7 > [20| . 
This choice of g defines a GCE known as the Gaussian 
ensemble (GE) f21]. Fig. 2] shows the result of the nu- 
merical computation for three increasing values of the 
parameter 7. For each value, the free energy of the GE, 
which we denote by ip 7 (a), has the particularity that it 
possesses one nondiffcrcntiablc point, located at a 7 , sep- 
arating two differentiable branches. The left branch is a 
linear function of a with a slope independent of 7. The 
value of the slope is equal to —1/6, which is the maxi- 
mum admissible value of h [22^. To obtain the value of 
s(u) at this specific value of h, which we denote by u max , 
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FIG. 1: Gaussian free energy of the CWP model for different 
values of 7. 



we simply need at this point to apply Eq. © to the left 
branch of <£> 7 (a). The result obtained is s(it max ) = In 3, 
which is the correct value found in the ME corresponding 
physically to the entropy of the excited state of the CWP 
model. The right, differentiable branch of y 7 (a) can be 
treated similarly with the difference that <^ 7 (cv) is not a 
constant but takes values in a range of the form [u m i n , u 7 ], 
where u m - m = tp'(oc) = — 1/2 and u 7 = </? 7 (ct 7 + 0). The 
calculation of Eq. JSJ for the right branch of ip 1 there- 
fore yields s(u) in that range. The result is displayed in 
Fig. EI a) together with the exact entropy function s(u) 
derived directly from the ME (see Eq. (4.3) in 0). Note 
that, in order to accentuate the relatively shallow non- 
concave region of s(u) near it max , we have plotted the 
derivative of the entropy in Fig. [21 a) rather than the en- 
tropy itself. 

The comparison in Fig. Efa) of the entropy obtained 
in the GE with the "true" entropy of the ME shows that 
the use of the GE enables us to obtain s(u) in a gradual 
manner by increasing the value of the parameter 7. For 
7 = 0, s(u) is recovered from the ground-state mean en- 
ergy Mmin = — 1/2 up to the value u 7= o = —1/4, while for 
7 = 5, s(u) is recovered from u m i n to U5 w —0.1883 > uq. 
Increasing the parameter 7 further to 7 = 20, we obtain 
s(u) from u m in up to U20 ~ —0.1706 > U5. The case 
7 = corresponds to the CE, so that the part of s(u) ob- 
tained in this ensemble corresponds to the concave part 
of s(u) determined by Maxwell's construction or, equiva- 
lently, by the set of supporting lines of that function; see 
[lif for details. Thus, we see that a virtue of the GE with 
7 > over the CE is that the former ensemble is able to 
recover nonconcave points of the entropy function while 
the latter is not. 

By continuing to increase 7, we can calculate the value 



it 7 up to which the GE is able in to recover s(u). The 
results are shown in Fig.[3fb), as well as in Fig.|2Jc). The 
conclusion that we reach from these two figures is that the 
GE recovers the complete entropy function of the CWP 
model in the limit 7 — > 00, since u 7 — » u max = —1/6 in 
that limit. Hence, for a finite, positive value of 7, the GE 
recovers only a part of s(u), but the part of s(u) "missed" 
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FIG. 2: (a) Derivative of the entropy function found for the 
CWP model using the GE (black line) and ME (gray line). 
The dashed lines show the mean energy value u~, up to which 
there is equivalence between the GE and ME. (b) Convergence 
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by the GE can be made arbitrary small by choosing large 
enough values of 7. In this sense, we say that the GE 
and the ME are asymptotically equivalent. In general, we 
know that the GE should be equivalent with the ME at 
all values of h whenever 7 is greater than the maximum 
value of s"(u) (see ^5|, Theorem 5.2). In the case of the 
CWP, s"(u) diverges at u max , which explains why the 
two ensembles become equivalent for all u £ [w m in, Umax] 
only in the limit 7 — > 00. An analytical study of this 
limit is presented in [23| (see also [lilllr^ ). 

We can go further in our study of the CWP model by 
showing that the GE can be used not only to calculate 
s(u) but also to calculate the microcanonical equilibrium 
values of v of that model. This aspect of the GE, which 
we refer to as the macrostate level of ensemble equiva- 
lence, is illustrated in Fig. |21 This figure shows two sets 
of plots. The one on the left shows the equilibrium values 
of v calculated in the GE as a function of a 24], while 
the one on the right displays the same points but now 
as a function of their mean energy u = h{v) (black line). 
Note that only the first component of v is plotted be- 
cause the equilibrium value of v in the GE has the form 
v = (a,b,b) |23. The comparison with the ME is estab- 
lished by plottin g th e equilibrium value of a in the ME 
see Eq. (4.1) in jl3j) for all ] (gray line). 

The results obtained from these calculations show that 
the GE recovers the equilibrium macrostates of the ME 
for all u £ [wmin,u 7 ], with u 1 approaching u max for in- 
creasing 7. The value u 7 here is the same as the one 
found previously when calculating s(u), so that the range 
of mean-energy values for which we have equivalence of 
ensembles at the level of v is exactly the range over which 
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FIG. 3: (a) First component a of the equilibrium value of v 
in the GE as a function of a. (b) Equilibrium value of a in 
the GE as a function of the equilibrium mean energy of that 
ensemble (black line) ; equilibrium value of a in the ME (gray 
line). The dashed lines show the mean energy value it 7 up to 
which there is equivalence between the GE and ME. 



we have equivalence of ensembles at the thermodynamic 
level, i.e., the level of s(u). This is not a coincidence but 
a direct result of the fact that equivalence of the GE and 
ME at the thermodynamic level entails, essentially, the 
equivalence of these ensembles at the macrostate level 
[TiLll^ . As a result, we can conclude that the GE recov- 
ers, in the limit 7 — > 00, the microcanonical equilibrium 
values of v for all u e 



l mini " max 



], since it completely 
recovers s(u) in the same limit. 

In the end, it is interesting to note that the asymp- 
totic equivalence of the GE and ME is reflected, at the 
macrostate level, by the fact that the jump of a, seen in 
the GE as a function of a, disappears as 7 — > 00 (Fig.|3J). 
It can be proved in general that the disappearance of 
this jump, which is responsible for the nondifferentiable 
point of ifj (a) (Fig. , is a sufficient condition for the 
complete equivalence of the GE and ME (see 0] , Theo- 
rem 4). From a physical point of view, this means that 
the absence of a first-order phase transition in the GE is 
a sufficient condition for the equivalence of the GE and 
ME. 

To summarize, we have considered a mean-field ver- 
sion of the Potts model to show that the nonconcave 
entropy function of this spin model can be calculated us- 
ing a generalization of the canonical ensemble known as 
the Gaussian ensemble. The large deviation formalism 
that we have used to study this model is totally gen- 
eral in that it can be applied to other models known to 
have nonconcave entropies. Our future work will aim at 
studying a number of these models (see, e.g., [13)) m 
addition to studying other types of generalized canoni- 
cal ensembles, including the one defined by the function 



g(u) = j\u\. Although the Gaussian ensemble is thought 
to be universal, in the sense that it should be able to 
recover any form of entropy function 0, , other gen- 
eralized ensembles could be useful, in that they may lead 
to more tractable calculations than those carried out in 
the Gaussian ensemble (see, e.g., |26j). 
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